Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse

Add code
Mar 24, 2026
Viaarxiv icon

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

Add code
Mar 19, 2026
Viaarxiv icon

What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction

Add code
Mar 02, 2026
Viaarxiv icon

KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models

Add code
Mar 02, 2026
Viaarxiv icon

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing

Add code
Feb 10, 2026
Viaarxiv icon

DICE: Disentangling Artist Style from Content via Contrastive Subspace Decomposition in Diffusion Models

Add code
Feb 08, 2026
Viaarxiv icon

Humanoid Manipulation Interface: Humanoid Whole-Body Manipulation from Robot-Free Demonstrations

Add code
Feb 06, 2026
Viaarxiv icon

GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling

Add code
Feb 05, 2026
Viaarxiv icon

Mitigating Hallucinations in Video Large Language Models via Spatiotemporal-Semantic Contrastive Decoding

Add code
Jan 30, 2026
Viaarxiv icon